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EFFICIENT IN-BAND SIGNALING FOR DISCONTINUOUS 

TRANSMISSION AND CONFIGURATION CHANGES IN 
ADAPTIVE MULTI-RATE COMMUNICATIONS SYSTEMS 



5 Cross-Reference to Related AppKcations 

The present application claims the benefit of U.S. Provisional Application 
No. 60/109,694, filed November 24, 1998, which is incorporated herein in its 
entirety by reference. 

10 Field of the Invention 

The present invention relates to communications systems^ and more 
particularly, to discontinuous transmission (DTX) and configuration changes in 
adaptive multi-rate communications systems. 

15 Background of the Invention 

Today, multi-mode coding systems employing at least two different source 
and channel codec modes can be used to maintain near-to-optimum commxmication 
quality under varying transmission channel conditions. A mode with low source 
coding bit rate and a high degree of channel error protection can be chosen for 
20 bad channels. On the other hand, good channels allow selection of a codec mode 

with high source coding bit rate and a relatively low degree of error protection. 



As is wen known in the art. such multi-mode coding systems must convey 
(either explicitly or impUcitly) the actually chosen codec mode to a receiving 
decoder to enable proper decoding of received data. Two-way communication 
systems with codec mode adaptation have additionaUy to transmit similar 
information over the return link. This is either quantized link measurement data 
describing die present forward chamiel state, or a corresponding codec mode 
request/command taking the chamiel state account. Such link adaptation data is 
known in the art as codec mode information, consisting of codec mode indications 
(the actually selected codec mode) and codec mode requests/commands (the codec 
mode to be used on the transmitting side). The evolving Global System for 
Mobfle Commmiication (GSM) Adaptive Multi-Rate (AMR) standard employs the 
above described codec mode adaptation. 

In such AMR systems, in-band signaling is used to reallocate parts of the 
speech transmission resource for transmitting control infonnation. It is applied 
where no other suitable control chamiels are avaUable. The GSM AMR speech 
coding standard is an example which makes use of in-band signaling. It uses parts 
of the GSM speech traffic chamiel for die transmission of AMR link adaptation 
data. More specificaUy. the GSM AMR standard provides an in-band chamiel for 
the transmission of codec mode information. 

Codec mode information consists of codec mode requests/commands and 
codec mode indications, which are transmitted every second frame (every 40 ms). 



in alternating order. Codec mode information identifies a codec mode in a subset 
of up to 4 codec modes out of 8 (for adaptive fuU-rate speech, or AFS) or 6 (for 
adaptive half-rate speech. AHS) available modes. These codec mode subsets are 
referred to as active codec sets. 

In any communication system, including the above described GSM AMR 
system, transmission capacity is a limited and costly resource. For this reason, in 
order to save transmission capacity, Discontinuous Transmission (DTX) is widely 
applied when transmitting speech. Sometimes DTX is referred to as Voice 
(derated Transmission (VOX). The basic principle of DTX is to turn off 
transmission during speech inactivity. Instead, so-called comfort noise (CN) 
parameters are transmitted which enable the decoder to reproduce the mactivity 
signal, which usually is some kind of background noise. CN parameters require 
much less transmission resource than speech. DTX is also an important feature 
for mobUe telephones as it aUows turning off power consuming devices (such as 
radio transmitters) during inactivity. Doing so helps to save battery power and to 
increase the talk time of the phones. 

In two-way communication systems employing DTX, there will typicaUy 
be one link active while the other link is inactive (as one speaker is talking while 
the other is listening). The active link has. with some reduced frame transmission 
rate, to convey silence descriptor (SID) frames (also known as background 
information, or comfort noise, descriptor frames) to the receiver. SID frames 



contain CN parameters and enable a receiver to generate a comfort noise silence 
signal, for example to reassure a listening user that the connection is still active. 

In the present GSM speech coding standards FR, HR and EFR, DTX is 
realised m a very similar way. By way of example, the state of the art of DTX 
operated speech communication in the GSM system wiU be described with respect 
to the GSM EFR codec. For additional information, see for example die 
GSM 06.11. GSM 06.12, GSM 06.21, GSM 06.22, GSM 06.31, GSM 06.41, 
GSM 06.61, GSM 06.62, and GSM 06.81 standards, and related documents. The 
GSM EFR scheme is characterised as follows: 

End of speech activity is signaled by the transmission of a first SID frame, 
which is not phase-aligned to the SACCH. Rather, it is immediately following the 
last active speech frame. After such a first SID frame, update SID frames are 
transmitted with a period of once per 24 frames (=480ms). Update SID frame 
transmission is aligned with the time alignment flag (TAF), which is generated in 
the radio subsystems and which is derived from the SACCH frame structure. 
Apart from SID frames, no other frames are transmitted during inactivity. Simply 
resuming the transmission of active speech frames ends the inactivity period. 

The RSS handles SID frames as regular speech frames. This means in 
particular that die same channel coding and diagonal interleaving is used as for 
speech frames. A number of effectively fourty-three (43) net bits is used for the 
comfort noise parameters which describe spectral shape and gain of the inactivity 
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signal. Ninety-five (95) net bits are used for a special SID bit pattern to identify 
the firame as a SID frame and to make it distinct from speech frames. CN 
parameters are differentiaUy encoded with respect to parameters, which are 
derived from the last transmitted speech frames. 

The described SID frame transmission is iUustrated in Figure 1 for 
TCH/FS (i.e., traffic channel / fiiU-rate speech) and in Figure 2 for TCH/HS (i.e., 
traffic channel / half-rate speech). The upper row symboUses the speech frames, 
as they are seen at the input of the speech encoder. The middle row symboUses 
the TDMA frames that transmit the respective speech or SID bits via the radio 
interface. The lower row symbolises the speech or comfort noise frames after the 
speech decoder. Every speech frame is exactly 20 ms long. The TDMA frames 
have in average a distance of exacUy 5 ms. TDMA frames for SACCH and IDLE 
are not shown. Implementation delays and other side effects are not shown either. 

Apart from regular transmission of SID frames, synchronously and time 
aligned to a fixed time structure, ITU-T recommendation G.729/Annex B 
describes a DTX method which transmits SID frames whenever an update of the 
CN parameters is required because they have changed significantly since the last 
SID frame transmission. 

In the weU known Pacific Digital Cellular (PDC) system with VOX 
functionaUty, special post- and pre-amble frames are used to signal transitions 
from speech to inactivity or, respectively, back from inactivity to speech (see. for 



example. RCR STD-27D). These frames contain unique bit patterns on gross bit 
level to identify them. Post-amble frames consist of two chamiel frames of which 
the first carries no other infonnation than the identification bit pattern and of 
which the second carries comfort noise parameters describing the inactivity si^. 
During voice inactivity, post-amble frames are sent periodically to enable the 
receiving end to update the comfort noise generation. For both post- and pre- 
amble fi^es, the same interleaving is used as for speech frames. 

The above described conventional DTX solutions, as realized in GSM FR. 
EFR, and HR. are not well suited for use in multi-mode coding systems. This 
results from the feet that SID frame signaling is done on net bit level. A special 
bit pattern identifying the SID frame is part of die net bit stream. The SID frame 
detection unit at the receiver is executed after de-interleaving and channel 
decoding. This approach is inappropriate for multi-mode coding systems with 
more than one source and channel mode since the SID frame identification would 
depend on the correct choice of the codec mode for channel decoding. The 
correct codec mode at the receiver can. due to possible mode fransmission errors, 
not always.be guaranteed. 

Moreover, for analogue reasons, variations of the interleaving scheme, 
either for die different codec modes or for SID frames, are also impractical, for 
complexity reasons. Such approaches require in the worst case to run SID frame 



de-interleaving and. more severe, channel decoding in addition to speech frame 
de-interleaving and channel decoding. 

Additionally, there are at least two major problems in adopting the PDC 
realization. Firstly, as post-amble frames consist of two ti-affic frames, the 
inactivity ti-ansmission mode is relatively inefficient in terms of ti:ansmission 
power savings. Each comfort noise parameter update requires die tiransmission of 
two iframes. Secondly, as transitions from speech inactivity to activity are 
signaled by pre-amble frames, either parts of the speech onsets may be clipped or 
tiransmission of speech onsets is resumed delayed by die pre-amble frame. The 
former effect direcfly degrades die quality of die reconstructed speech while die 
latter mcreases the speech transmission delay which may cause degradations of the 
conversational quality. 

Note also that applying a common diagonal interleaving scheme over two 
frames for SID and speech frames, as is presentiy done m bodi GSM and PDC, 
causes further problems. Applying diagonal interleaving for transmission of 
single SID frames is inefficient in terms of radio resource usage and power 
consumption since only one half of every ti-ansmitted TDMA frame carries SID 
information while the other half remains unused and is dius wasted (such wasted 
half bursts are marked in Figures 1 and 2). 

This efficiency loss in current GSM and PDC systems is small as SBD 
frame ti-ansmission is relatively seldom. However, it is more severe for new 



multi-mode communication systems with codec mode adaptation. High adaptation 
perfonnance requires much more frequent information transmission (adaptation 
data) over the inactive link compared to the transmission of SID frames in current 
systems. 

Moreover, there are certain upper limits of the radio channel activity 
during inactivity (e.g., the AMR system requirement is: TCH/AFS: 16 TDMA 
frames per 480 ms multiframe; TCH/AHS: 12 TDMA frames per 480 ms 
multiframe). Wasting half of the available radio resource would mean that codec 
mode information could only be transmitted half frequently than principally 
possible. The result is a potential performance loss due to slower codec mode 
adaptation. 

A further disadvantage of applying the same diagonal interleaving for SID 
frames (carrying codec mode information) as for speech frames is the delay caused 
by this kind of interleaving. With respect to achieving the best possible 
performance of codec mode adaptation of the multi mode communication system, 
transmission delay of codec mode information should be kept at a minimum. This 
prohibits the usage of diagonal interleaving. 

A particular problem in systems with DTX is the detection of speech 
onsets after periods of inactivity. Missing the onset results in clipped speech 
ou^ut of the decoder. On the other hand, if a non-transmitted frame is 



erroneously detected as a speech onset frame, undesirable plop or bang sounds can 
be produced which can degrade communication quality considerably. 

In principle, AMR systems with DTX operation merely need to transmit 
codec mode requests for the currently active link over the inactive link. No codec 
mode indications for the inactive link need be transmitted. However, \yhen the 
inactive link becomes active again, a suitable codec mode must be selected. A 
solution of how to select the codec mode for speech onsets after inactivity has to 
be found which ensiu-es that transmitting and receiving side apply the same mode. 
Moreover, this codec mode should be suitable with respect to the current radio 
channel conditions. 

Apart from the codec mode signaling method in the AMR standard, so far 
no further fast control channels are available. However, there is a need for such a 
channel in order to be able to perform fast configuration changes (e.g., to change 
an active codec set, to change the phase of codec mode information in order to 
minimize transmission delay, to handover to an existing GSM codec such as FR, 
EFR, or HR, and/or to switch to a future application such as a wideband codec, 
speech and data, or multi-media). 

Accordingly, there is a need for improved methods and apparatus for 
performing DTX and configuration changes in adaptive multi-rate systems. 
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.^mmmary of the Inventdon 

The present invention fulfills the above-described and other needs by 
providing novel solutions for DTX and fast in-band signaling of configuration 
changes and protocol messages, as well as the interaction of both operations, in 
the context of adaptive multi-rate systems. Advantageously, the disclosed 
methods and apparatus are cost efficient in terms of radio transmission capacity, 
in terms of fixed line transmission, and in terms of implementation effort. 

An exemplary method for performing discontinuous transmission (DTX) in 
a communications system in which source data is interleaved for -transmission 
from a first component in the system to a second component in the system 
includes the steps of detecting periods of source data ioactivity, and transmitting 
silence descriptor (SID) frames from the first to the second conq}onent during the 
periods of source data inactivity, wherein certain of the transmitted SID frames 
are interleaved using a different interleaving algorithm as compared to that used 
for source data. For example, the source data can be block diagonally 
interleaved, and certain of the SID frames can be block interleaved. 

The exemplary method can further include the steps of transmitting a first 
type of SID frame to indicate a transition from source data activity to source data 
inactivity, periodically transmitting a second type of SID frame during source data 
inactivity, and transmitting a third type of SID frame to indicate a transition from 
source data inactivity to source data activity. Advantageously, where the 
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conrniunicatioiis system is an adaptive multi-rate (AMR) system, the SED frames 
can include codec mode information in addition to silence description information. 

An exemplary method for transmitting protocol messages from a first 
component to a second component in a speech communications system includes 
the step of transmitting an escape frame in place of a speech data frame, the 
escape frame including a gross bit pattern to distinguish the escape frame from 
speech data frames and conveying a protocol message. The escape frame can 
further include a data field to indicate to the second conq)onent a particular 
protocol message. 

An exenqjlary method for effecting configuration changes in a 
commimications system includes the step of transmitting an escape frame in place 
of a speech data frame, the escape frame including a gross bit pattern to 
distinguish the escape frame from speech data frames and conveying a 
configuration change indication. The escape frame can further include a data field 
to indicate to the second component a particular configuration change to be made. 

For example, where the communications system is an AMR system, an 
escape frame can be used to change an active codec mode set. Alternatively, an 
escape frame can be used to change a phase of codec information. 

The above-described and other features and advantages of the invention are 
explained in detail hereinafter with reference to the illustrative examples shown in 
the accompanying drawings. Those of skill in the art will appreciate that the 
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described embodiments are provided for purposes of illustration and understanding 
and that numerous equivalent embodiments are contemplated herein. 

Brief Descriptioa of the Drawing s 

Figure 1 depicts an exemplary full-rate silence descriptor (SID) frame 
transmission scheme according. 

Figure 2 depicts an exemplary half-rate silence descriptor (SID) frame 
transmission scheme. 

Figure 3 depicts an exemplary adaptive multi-rate communications system 
in which the present invention can be implemented. 

Figure 4 depicts an exemplary SID frame format according to the present 
invention. 

Figure 5 depicts an exemplary full-rate SID frame interleaving scheme 
according to the present invention. 

Figure 6 depicts an exemplary half-rate SID frame interleaving scheme 
according to the present invention. 

Figure 7 depicts an exemplary first-SID frame format according to the 
present invention. 

Figure 8 depicts an exemplary speech-onset frame format according to the 
present invention. 



-13- 

Figure 9 depicts an exemplary scheme for inhibiting first-SID frames 

according to the present invention. 

Figure 10 depicts an exemplary scheme for inhibiting regular SID frames 

according to the present invention. 

Figure 1 1 depicts an exemplary full-rate scheme for detecting transitions 

from speech inactivity to speech activity according to the present invention- 
Figure 12 depicts an exemplary half-rate scheme for detecting transitions 

from speech inactivity to speech activity according to the present invention. 

Figure 13 depicts an exemplary fall-rate scheme for detecting a speech 

onset when a speech-onset indication frame is replaced by a system configuration 

change frame according to the present invention. 

Figure 14 depicts an exemplary half-rate scheme for detecting a speech 

onset when a speech-onset indication frame is replaced by a system configuration 

change frame according to the present invention. 

Detailed Description of the Invention 

Although embodiments of the invention are described hereinafter with 
respect to speech transmission in the GSM system, those of skill in the art will 
immediately appreciate that the disclosed techniques are equally applicable in 
other contexts. For example, the invention is readily applied in any wireless or 
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fixed-line conmumication system, including TDMA systems (e.g., D-AMPS), 
PDC, IS95, and the Internet. 

Figure 3 depicts an exemplary AMR system in which the techniques of the 
present invention can be implemented. The exemplary AMR system includes a 
Transcoding and Rate Adaption Unit (TRAU) and a Base Station (BTS) on the 
network side, as well as a Mobile Station (MS). On the network side, a speech 
encoder (SPE) and a channel encoder (CHE), as well as a channel decoder (CHD) 
and a speech decoder (SPD), are connected via the well known serial A-bis 
interface. For each link, qxiality information is derived by estimating the current 
channel state. Based on the channel state, and also taking into consideration 
possible constraints firom network control, the codec mode control, which is 
located on the network side, selects the codec modes to be applied. 

The channel mode to use (TCH/AFS or TCH/AHS) is controlled by the 
network. Uplink and downlink always apply the same chaimel mode. 
For codec mode adaptation, the receiving side performs link quality measurements 
of the incoming link. The measurements are processed yielding a Quality 
Indicator. For uplink adaptation, the Quality Indicator is directly fed into the UL 
mode control unit. This unit compares the Quality Indicator with certain 
thresholds and generates, also considering possible constraints from network 
control, a Codec Mode Comunaand indicating the codec mode to be used on the 
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uplink. The Codec Mode Command is then transmitted in-band to the mobUe side 
where the incoming speech signal is encoded in the corresponding codec mode. 

For downlink adaptation, the DL Mode Request Generator within the 
mobile compares the DL QuaUty indicator with certain thresholds and generates a 
Codec Mode Request indicating die preferred codec mode for the downlink. The 
Codec Mode Request is transmitted in-band to the network side where it is fed 
into the DL Mode Control unit. This unit generaUy grants the requested mode. 
However, considering possible constraints from network control, it can also 
override the request. The resulting codec mode is then applied for encoding of the 
incoming speech signal in downlink direction. 

Both for uplink and downlink, the presently applied codec mode is 
transmitted in-band as Codec Mode Indication together with the coded speech 
data. At the decoder, the Codec Mode Indication is decoded and applied for 
decoding of the received speech data. 

Codec mode selection is done from a set of codec modes (ACS, Active 
Codec Set), which may include 1 to 4 AMR codec modes. Associated with this set 
is a list of 1 to 3 switching thresholds and hysteresises used by die DL Mode 
Request Generator and the UL mode control unit to generate die Codec Mode 
Requests and Codec Mode Commands. These configuration parameters (ACS, 
thresholds, hysteresises) are defined at caU setup and can be modified at handover 
or during a call. 
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According to the inventioii, DTX in a system such as that shown m 
Figure 3 is based on in-band signaling with three different frame types: 
SrD_FIRST, regular SID, and speech onset frames. These frame types have m 
common that they use particular gross bit patterns, which identify them. 
Moreover, they can also convey payload data, which consists of CN parameters 
and codec mode information. For exan^Ie implementations according to the 
invention, see GSM 05.03: Digital cellular telecommunications system (Phase 
2+); Channel coding (draft ETSI EN 300 909 ¥7.2.0 (1999-11)) and GSM 06.93 
Digital cellular telecommunication system (I%ase 2-f-); Discontinuous 
Transmission (DTX) for Adaptive Multi-Rate (AMR) speech traffic chamiels 
(draft ETSI EN 301 707 V.7.2.0 (1999-11)), each of which is mcorporated herein 
in its entirety by reference. 

SID frames are identified on gross bit level. SID frames are defined to be 
transmitted using k TDMA fi^es, i.e. they consist of ife * 114 bits. A suitable 
choice for * is 4. In fliis case SID frames consist of 456 bits, i.e. of one channel 
frame of 456 bits for TCH/AFS and of two channel frames of 228 bits each for 
TCH/AHS. Each SID frame has a SED frame identification field containing a 
unique bit pattern and two message fields. The one message field is reserved for 
channel encoded comfort noise (CN) parameters, the other for channel encoded 
codec mode mformation. The codec mode information field can carry codec mode 
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requests only, or it can be further subdivided into two parts, one carrying codec 
mode requests/commands and the other carrying codec mode indications. 

An example of the regular SID frame format definition is given in 
Figure 4. In this example, the SID frame consists of a SID frame identifier of 212 
bits, a field of 212 bits for the comfort noise parameters and a field of 32 bits for 
the codec mode information. In this example, it is assumed that the CN 
parameters are convolutionally encoded and the codec mode information consists 
of block encoded requests/conmiands and mdications. In an alternative solution 
the two message fields can be put together, if, e.g., CN parameters and codec 
mode mformation are both encoded using the same convolutional or block code. 

According to the invention, regular SID frames are block interleaved 
rather dian diagonally. While this gives away possible interleaving gain (i.e., the 
transmission is potentially less robust against transmission errors), SID frames 
generally cany less information than regular speech frames, and they can 
therefore be protected using more powerfiil channel codes than used for speech 
transmission. This compensates for the loss m interleaver gain, or even makes 
SID frame transmission more robust than is possible for current solutions (GSM 
FR, EFR, or HR). Important information like codec mode information can, e.g., 
be protected by stronger channel codes (compared to the in-band transmission of 
codec mode information in regular speech frames). Furthermore, CN parameters 
are usually represented with much fewer bits than speech parameters. The few 
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CN bits can thus be protected with lower rate channel codes. As an exaiiq>Ie, out 
of a number of 35 CN bits, aU can be protected, first, by a 14 bit CRC code 
(which makes possible very powerful error detection), and then by using a rate Va 
convolutional code (constraint length *=5). Moreover, both CN parameters and 
codec mode information is generally relatively slowly varying information. Also 
taking into account the proposed SID frame rate (of every 8th frame), which is 
much higher than in existing solutions, occasional losses of a SID frame due to 
channel errors are even tolerable. 

As shown in the respective Figures 5 and 6. for both TCH/AFS and 
TCH/AHS, SID frames consisting of 4 * 114 bits are mapped, according to the 
invention, by block interleaving onto 4 TDMA frames. The purpose of the 
interleaver is to distribute the SID frame bits in such a way onto the available 
TDMA firames that the robustness against transmission errors is maximized. The 
diagonal interleaver for speech frames is not used. As de-interleaving is not very 
demanding m terms of complexity, this solution with a particular block interleaver 
for SID frames is feasible. In the worst case, flie decoder executes both SID 
frame block de-interleaving and conventional speech frame diagonal de- 
interleaving, but not more than one channel decoder. Advantageously, the 
problem m current GSM and PDC systems of wasted bits in the TDMA frames 
belonging to SID frames is thus solved. 
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For TCH/AFS, the actual block interleaving scheme for the SID frame is 
of relatively minor importance. In order to get a maximal interleaver gain, 
identification marker bits, as well as CN and codec mode information bits are 
distributed as equaUy as possible on the TDMA frames used for transmission. 

For TCH/AHS, special cases can occur due to the feet that the SBD frame 
is transmitted using 2 channel frames. As discussed in detail below with respect 
to SID inhibit frames, the situation can occur when the first half of the TDMA 
frames carrying the SID frame has been transmitted and the second half cannot be 
transmitted due to a speech onset. For this case, it is important to be able to 
inhibit the SID pattern, which has afready been sent. This is ensured by 
transmitting the second half of the pattern bits on the odd positions of the second 
half of the TDMA frames. With respect to the codec mode information, it is 
important that the codec mode to be used for decoding the speech onset is 
available. This can be ensured by also transmitting the second half of the codec 
mode indication bits on the odd positions of the second half of the TDMA fi^es. 

A possible solution is to map the pattern bits and the codec mode 
indication bits on the TDMA frames by using diagonal interleaving. 
Consequently, the CN bits and codec mode request/command bits are transmitted 
in the odd positions of the first half of die TDMA frames and in the even positions 
of the second half of the TDMA frames. The described interleaving scheme for 
SID frames on TCH/AHS is illustrated in Figure 6. 
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According to the invention, particular SID_FIRST frames are transmitted 
inmiediately after the last speech frame when going from activity to inactivity. 
The solution is merely to identify end of speech rather than also transmitting CN 
parameters. An example solution for TCH/AFS is to use a 228 bit field consistmg 
of 212 marker bits and 16 bits for codec mode information, as shown in Figure 7. 
The codec mode mfonnation is either requ^t/command or indication, depending 
on what is in turn (if a speech frame had been transmitted). The type of codec 
mode information transmitted with the SID_FIRST frame thus depends on the 
frame number and the transmission phase of the codec mode information. A 
special interleaver m^s the SID_FIRST frame onto the 228 bits avaUable m the 
unused half bursts. Figure 5 illustrates the described transmission scheme of the 
SID_FIRST frame for TCH/AFS. Note there are no longer wasted half bursts. 

An analogue solution for TCH/AHS would transmit a SID FERST 
identification pattern and codec mode mformation on the 2 available usuaUy 
unused half bursts. An example which makes the detection of SID_FIRST more 
reliable is, however, to use also the next 2 TDMA frames. This means that 2 
channel frames SID_FIRST_1 and SID_FIRST_2 are transmitted. A possibly 
identical 228 bit frame as it is used m the TCH/AFS example solution (consisting 
of 212 marker bits and 16 bits for codec mode information, see Figure 7) is 
mapped on tiie even positions of die TDMA frames, which carry die last speech 
frame (unused half bursts), and on the odd positions of die two subsequent TDMA 
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frames. This kind of diagonal mapping allows appUcation of the existing diagonal 
(de-)interleaver. The codec mode information is either request/command or 
indication, depending on the frame number and the transmission phase of the 
codec mode information. Transmitted is that kind of codec mode information, 
which would have been sent in die respective channel frame if speech had been 
transmitted. The mapping is done in a way that equal portions both of the pattern 
bits and the codec mode information bits are put on the first two and the second 
two used TDMA frames. 

Figure 6 illustrates a technique for increasing the reliability of SID FIRST 
frame detection still further. According to the invention, the even positions of the 
additional two TDMA frames are filled with an additional identification pattern. 
It is also possible to use a part of diese half bursts for the transmission of codec 
mode information. The identification pattern could also be the code word of the 
codec mode information, repeated that often that all available bits are used. If. 
e.g., 114 bits are available and the code word for the codec mode mformation is 
16 bits wide, then it could be repeated 114/16 times. 

The diagonal interleaving used for speech frames implies that the odd 
positions of the first half of the TDMA frames carrying the first speech frame 
after an inactivity period are free for oUier purposes. A solution improving onset 
detection, according to the invention, is to fill these bits with a special onset 
identification pattern. Moreover, parts of these bits can also be used for 



-22- 

transmission of a codec mode indication that signals the codec mode according to 
which the first speech frame is encoded. A solution which both conveys an onset 
bit pattern and the codec mode indication is to repeat the codec mode indication 
code word that often that aU available bits are used, as is Ulustrated in Figure 8. 
An example for TCH/AFS is to repeat the 16 bit code word of the indication 
228/16 times. For TCH/AHS, the 16 bit code word is repeated 114/16 times. 
Such an onset frame is mapped by a particular interleaver onto the otherwise 
unused half bursts. The respective frame ttansmission schemes both for 
TCH/AFS and TCH/AHS are depicted in Figures 5 and 6. 

For TCH/AHS. rejgular SID frames and SID_FIRST frames are transmitted 
using 2 channel frames. Situations can thus occur in which a higher prioritized 
speech onset is ttansmitted after die first but before the second channel frame of 
the SID frame has been transmitted. In such a case, the error event could happen 
that the receiver misses the onset and instead detects a SID or, respectively, a 
SID FIRST frame, even though it has actually only received the first half of it. 

To help avoid this problem, a special SID FIRST mhibit frame is used 
instead of a regular onset frame when the first half of the TDMA frames carrying 
the SID_FIRST have been sent but the second half cannot be sent due to a speech 
onset. The pattern bits belonging to the second half of die SID FIRST frame, 
which would have been ttansmitted, are now inverted. This inhibits the detection 
of die whole SID_FIRST pattern at the receiver. The codec mode information bits 
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remain the same, as from the original SID_FIRST frame. The receiver wiU get ai 
unusable frame in the described situation. It is useful to hide this frame by 
applying proper error conceahnent (EC) techniques. The described case is 
illustrated in Figure 9. 

Another special frame, namely a SID inhibit frame, is used instead of a 
regular SID frame when the &st half of the TDMA frames carrying the SID have 
been sent but the second half cannot be sent due to a speech onset. The pattern 
bits belonging to the second half of the SID frame, which would have been 
transmitted, are now inverted. This inhibits the detection of the whole SID 
pattern at the receiver. The codec mode information bits, which represent a codec 
mode indication, remain the same, as from the original SID frame. The receiver 
wiU get an unusable frame m the described situation, for which it will continue to 
generate CN using the previous CN parameters. The receiver can also check for 
the patterns that are transmitted in this special case m order to detect speech 
onsets with improved reliability. The described case is iUustrated m Figure 10. 

According to the invention, SID frames are transmitted during inactivity 
every /i^ frames (TCH/AFS) and, respectively, every «h„ frames (TCH/AHS). A 
suitable choice isnpg = n^- 8. Phase-aligned transmission and decoding of SID 
frames (alignment deduced from the SACCH, as in the current GSM system) is 
one solution existing in today's GSM system, which helps to achieve good SID 
frame decodmg performance. However, the proposed SID frame identification 
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based on gross bit patterns provides that high SID frame detection performance, 
that more flexible solutions without a fixed phase are possible. 

One example is to start transmission of SID frames with the third fi^une 
after the transmission of the SID JFIRST pattern, and then to transmit SID frames 
every 8th frame. An alternative solution is asynchronous SID transmission (i.e., 
not aligned to any fixed tune structure). As an exan^le, SID frames are 
transmitted whenever a mode request changes, possibly with the constraint that a 
certain maximum of transmitted TDMA frames per 480ms multiframe has not yet 
been exceeded. Another enhanced solution can transmit a SID frame if the CN 
parameters have changed significantly and certain maximum of transmitted TDMA 
frames per 480 ms multiframe has not yet been exceeded. Such solutions with 
asynchronous SID frame transmission can fall back to time aligned transmission 
whenever certain minimum transmission requirements per time interval have not 
been met. 

Note that the different bit patterns which are sent for identifying the 
different frame type^ can partially be corrupted by transmission errors. In order 
to ensure reliable detection of the patterns also in the presence of channel errors, 
correlation techniques can be used. One possible solution is to coxmt the niraiber 
of matching bits, when comparing the received bits with the patterns. As an 
example, if 70% of the bits coincide, then the receiver can regard the pattern as 
foimd. An alternative solution using soft bit information is to accimaulate the 
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received soft bits with a positive sign if the corresponding bit of the pattern is 1 
and with a negative sign if the corresponding bit is 0. This accumulated measure 
can be normalized by the product of length of the pattern and the maximal 
possible soft bit value. If the normalized measure exceeds a certain threshold, 
e.g. 0.4, the receiver can regard the pattern as found. 

One ftirther criterion, which can be used for SID frames, is the CRC of the 
CN bits. If there is a CRC error, the fi-ame is not regarded as a valid SID frame. 

For cost reasons, it is desirable that the identification patterns do not 
require much memory for storing them. As an example, the identification pattern 
for SID_FIRST and regular SID for TCH/AFS can be constructed by repeating 
short 9-bit sequences ceil((228-16)/9) = 24 times and then discarding the last 4 
bits. Such a 9-bit sequence is. e.g., (0, 1, 0, 0, 1, 1, 1, 1, 0}. 

For THS/AHS, it is ftuther important to avoid the possible decoding of a 
SID_FIRST flrame as a regular SID. frame, and vice-versa. Therefore, the 
identification patterns for SID and SID_FIRST are made as distinct as possible. 

As an example, the pattern for the SID^FIRST frame can be identical to 
the pattern used for TCH/AFS. The pattern used for regular SID frames can then 
be constructed by inverting the SID FIRST pattern. 

The solution to transmit only a special bit pattern and codec mode 
information in the SID_FIRST frame rather than also transmitting CN parameters 
helps to keep the DTX efficiency at a maximum (i.e. , the activity on die air 
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interface is kept at a minimum). At the same time, the detection reliability of the 
identification pattern can be maximal since all available bits are used for the bit 
pattern (except those used for transmission of codec mode infomfiation). A 
problem with this is, however, that the receiver does not get a set of CN 
parameters for CN generation during the period firom end of speech imtil the 
reception of the first regular SID frame. The solution is to derive the CN 
parameters locally in the receiver by using the speech parameters of the last n 
frames before end of speech. Usually, the encoder operates with hang-over, i.e. 
even though the VAD detects voice inactivity, a certain number of m frames is 
still encoded as speech. The decoder can thus derive CN parameters locally by, 
e^g., averaging the gain and spectral parameters of the hangover frames, i.e. n = 
m. Another solution is to apply the last received set of CN parameters of a 
preceding inactivity period. 

According to the invention, an AMR receiver incorporates a 2-state model 
with the states activity and inactivity. The purpose of this state model is to 
support the speech/SID/non-transmitted frame distinction. Going from activity to 
inactivity requires the detection of a SID_FIRST frame following speech frames. 

Going from the inactivity to the activity state requires to detect the speech 
onset identification pattern, and a valid first speech frame which can be decoded 
without CRC error and, optionally, which exhibits quality measurements that are, 
e.g., derived from the receiver/channel decoder and which exceed certain 
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thresholds. An example is the SFQ measure (gross bit error estimate), which 
must be below some threshold. The reliability of this state transition can be 
increased with the constraint that more than one frames must be decodeable 
without CRC error, and, optionally, not exceeding a certain SFQ measure. 
Another criterion, as illustrated in Figure 11, helping to properly detect transitions 
from inactivity to the activity is that received frames immediately following SID 
frames can never be a speech frame, provided that block interleaving is used for 
SID frames which requires less delay than diagonal interleaving for speech 
frames. Figure 12 illustrates this criterion for the example of TCH/AHS. 

Another way to improve the detection of first speech frames and to help to 
distinguish them from non-transmitted frames is to access measures from other 
components of the receiver (e.g., the RF receiver or the equalizer). Examples for 
such measures are carrier and interferer strength estimates and derived measures 
such as C/I ratio. 

A further way to improve both SID_FIRST and first speech frame 
identification performance is to transmit the TDMA frames carrying them with 
increased transmission power. 

According to the invention, the following solutions are suitable for 
defining the codec mode for speech onsets after a period of inactivity: 

(a) Selection of the most robust codec mode or, alternatively, with the n-th 
robust codec mode. The safest solution is to choose n=l. No codec mode 
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indication need to be transmitted. The drawback for n= 1 is that, for good 
channels, a too robust codec mode with low intrinsic speech quality is selected: 

(b) Selection of the same codec mode as for the currently active link. This 
is motivated by the fact that uplink and downlink channel qualities are similar. 
The transmitting side of the link resuming speech transmission applies the codec 
mode which it is requesting for the incoming currently active link. The receiving 
side of the link becoming active again knows the applied codec mode as it is 
identical with the codec mode requests which it is receiving for application on the 
outgoing currently active link. The scheme can be made more robust if a mode is 
selected for speech onsets which is n (e.g. n=l) modes more robust than the mode 
of the currently active link (provided that such a more robust mode exists). 

(c) Selection of the same codec mode which was selected at the end of the 
last speech period preceding the inactivity period. This is motivated by the fact 
that radio channel conditions generally do not change very fast. The scheme can 
be made more robust if a mode is selected for speech onsets which is n (e.g. n=l) 
modes more robust than the mode which was used at the end of the last speech 
period (provided that such a more robust mode exists). 

(d) Selection accordmg to measurements of the mactive link. As 
transmission on inactive links is not completely stopped, link quality measuring is 
possible. Corresponding measurement reports or codec mode requests / 
commands are transmitted over the active link. When the inactive link resumes 
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speech transmission, a codec mode corresponding to the last received codec mode 
request is selected. 

Advantageously, solutions (a), (b) and (c) above can make use of the fact 
that no codec mode requests for the inactive link need to be transmitted. The 
active link can thus save the transmission capacity for codec mode request and use 
it for some other purpose. An example is to use this transmission capacity for 
better protected transmission of codec mode indications. 

In addition to the above described techniques for performing DTX in AMR 
systems, the invention further provides techniques for performing fast 
configuration changes in AMR systems. The purpose of these techniques is to 
enable fast configuration changes which cannot be done using existing slow 
control channels. Moreover, existing control channels cannot ensure that 
configuration changes are synchronized with speech data transmission. Like the 
above described DTX mechanism, the configuration change mechanism is based 
on in-band signaling. Applications are, e.g., in connections with tandem free 
operation (TFO), the change of the active codec set and the change of the phase of 
the codec mode information (in order to minimize transmission delay). Further 
general applications are handovers to one of the existing GSM codecs (PR, EFR, 
HR), or to switch to a future application as, e.g. a wideband codec, speech and 
data, or multi-media. Like the DTX mechanism, the configuration change 
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mechanism is described with respect to TCH/AFS and TCH/AHS in the GSM 
system, but is equally applicable in other contexts. 

The configuration change mechanism is based on frame stealing similar to 
the weU known FACCH frame stealing (i.e., speech frames are replaced by 
configuration change frames), and is therefore referred to hereinafter as escape 
signaling. Since the escape signaling mechanism is used only occasionaUy during 
a connection and only few speech frames will be stolen, the error concealment 
unit at the receiver is able to make the frame stealing virtually inaudible. 

According to the invention, escape frames are of similar format as the SID 
frames described above, they are identified on gross bit level by a particular 
identification pattern. Like SID frames, they include this pattern and one or two 
message fields. One field carries the actual channel encoded escape message, the 
other codec mode information. As an example, the escape frame can include 456 
bits and be of exactly the same frame format as SID frames (see, for example, 
Figure 4), where the CN field is replaced by the escape message. 

The payload to be transmitted by the escape mechanism is caUed the escape 
message. Escape messages constitute of a number of net bits, which can be 
grouped to logical units. For example implementations according to the 
invention, see GSM 05.09: Digital cellular telecommunication system (Phase 2+); 
Link Adaptation (draft ETSI EN 301 709 V7.1.0 (1999-11), which is incorporated 
herein in its entirety by reference. 
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Escape messages can be chaimel encoded with any suitable channel coding 
scheme, as e.g. block or convolutional coding. One cost efficient solution is to 
use exactly the same channel coding as used for the CN parameters in the SID 
frame as described above. This means. foUowing the above described example 
solution with 35 CN bits, that an escape message of 35 net bits is protected with a 
14 bit CRC and then convolutionaUy encoded with a code rate of 1/4 and 
constraint length it= 5. 

As with die Sm frame, the codec mode information field can carry both 

block or convolutionaUy encoded codec mode indication and codec mode 

command/request. 

Escape frames are block diagonally interleaved, like speech frames. This 
implies, assuming the example solution with an escape frame of 456 gross bits, 
that an escape frame replaces one speech frame on TCH/AFS and two speech 
frames on TCH/AHS. 

For TCH/AHS, these are not necessarily two consecutive frames, however 
it is assmned in the described example solution. Not stealing two consecutive 
frames is advantageous for the error conceahnent in order to hide the stealing. On 
flie other hand, stealing two consecutive speech frames is beneficial in terms of 
fransmission delay of the escape message. The interleaving is done in such a way, 
that die first half of the escape frame (228 bits, see Figure 4) replace die first 
speech frame. It is important that this first half contains the escape identification 
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pattem. This enables the receiver to check for this pattern. After finding the 
pattern, the receiver is able to locate the second stolen speech frame, which 
carries the second half of the escape frame. 

In order not to interfere with the regular transmission of codec mode 
information, the interleaver can further map one of the codec mode information 
code words on bit positions of the first stolen speech frame. Consequently, the 
other codec mode information code word is mapped on bit positions of the second 
stolen speech frame. Furthermore, the placing of codec mode information, i.e. 
codec mode indication and requests/commands into the codec mode field is done 
with respect to the codec mode information phase during transmission of regular 
speech frames. If. e.g.. the first half of the escape frame replaces a speech frame 
which would have carried a codec mode indication, then this first half of the 
escape frame has still to transmit a codec mode indication. 

Note that the above described escape mechanism can also be used in 
conjunction with the above described DTX mechanism. Thus, according to the 
invention, escape frames can replace not only speech frames, but also all other 
types of frames, namely SID_FIRST. regular SID. NoTX. and speech onset 
frames. Considering the case that an escape frame is to be sent during an 
inactivity period, it is efficient in terms of transmission resource usage to apply 
block interleaving, as is done.for SID frames. However, since the escape 
mechanism is targeted to be used only occasionally, transmission resource usage is 
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not the most important criterion. Rather, cost efficient implementation and low 
complexity is important. Therefore, a beneficial solution is to keep the frame 
format, chamiel coding, and block diagonal interleaving, which is also used for 
escape frames during speech. 

Note that using block diagonal interleaving for escape frames during DTX 
implies that there are half bursts not defined by the interleaving. For TCH/AFS. 
the odd positions of the first 4 bursts and the even positions of the last 4 bursts 
carrying the esc^ frame are undefined. Undefined bits are no problem per se. 
however, the foUowing problem can be solved by setting the undefined positions 
appropriately. Consider the case of a speech onset. As described above, a speech 
onset frame is marked with an onset pattern which both allows for better 
identifying the frame as an onset and identifying the codec mode used for the 
onset speech frame. If an escape frame must be sent at the same time, it will 
replace the onset frame. Thus, for subsequent speech frames, it is more difficult . 
to identify them as speech frames, since the onset pattern was stolen. 

According to the invention, this problem is avoided by fiUing the first half 
of the undefined bits (odd positions) with the onset pattern, regardless if there is 
an onset or not. For the case that there was actuaUy no onset, it needs to be 
signaled that inactivity continues. Sending SID_FmST immediately following the 
escape frame does this. This defines the second half of the otherwise unused bits 
(even positions). This solution is further beneficial in terms of implementation 
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costs. It allows handling of escape frames, apart from channel codmg, exactly as 
if it was speech. Figures 13 and 14 illustrate the described solution with respect 
to TCH/AFS and tCH/AHS, respectively. 

Note that speech frames, which have been stolen for escape purposes, 
cannot be rescheduled for transmission after the escape, since this would increase 
the speech transmission delay. However, SID frames that are affected by the 
escape frame transmission can be rescheduled for transmission immediately after 
esc^e frame transmission. Advantageously, this helps to maintain a high 
subjective comfort noise signal quality. Example solutions are provided in the 
above cited GSM 06.93. 

In order to ensure correct reception of escape messages and to define 
appropriate routines for error events, an escape protocol is proposed. Example 
solutions are provided in the above cited GSM 05.09. 

Those skilled in the art will appreciate that the present invention is not 
limited to the specific exemplary embodiments which have been described herein 
for purposes of illustration and that mmierous alternative embodunents are also 
contemplated. The scope of the invention is therefore defined by the claims 
appended hereto, rather than the foregoing description, and all equivalents which 
are consistent with the meaning of the claims are intended to be embraced therein. 



